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Abstract 

We investigate the variance of the optimal ahgnment score of two independent 
iid binary, with parameter 1/2, sequences of length n. The scoring function is such 
that one letter has a somewhat larger score than the other letter. In this setting, 
we prove that the variance is of order n, and this confirms Waterman's conjecture 
in this case. 

1 Introduction 

The problem under investigation in this article is similar to the corresponding prob- 
lem for the Longest Common Subsequence (LCS), which is, in fact, a special case 
of optimal alignment. It is well known that the determination of the order of the 
fluctuations for the LCS-problem is one of the main problem in computational bi- 
ology, and to this day it remains open. As it will be explained below, the optimal 
alignment problem can be formally viewed as a last passage percolation problem 
with correlated weights. For first and last passage percolation the exact order, in 
the general case, is still unknown. In some special cases it has been found as well as 
for the Longest Increasing Subsequence (LIS) problem, where the exact asymptotic 
distribution is known. In turn, the LIS problem is itself asymptotically equivalent 
to a first passage percolation problem on a Poisson graph. 

We will return later to the history of these questions and of related ones but, 
first, let us explain the applications we have in mind. Optimal alignment has gained 
tremendous significance in both computational biology and computational linguis- 
tics. The reader will find in the standard references [8], [10], [19], and [20] for a 
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general discussion of the relevance of string comparison, and related problems, in 
computational biology. 

Optimal alignments and closely related methods are one of the main tools for identifying 
genes. For example, with the help of optimal alignment one searches for the location of gene 
X. Assume that we have already identified the location of related genes. The related genes 
will exhibit a certain amount of similarity. We hunt through the genome in the hope of 
finding a spot that resembles X. To evaluate the degree of similarity, we use an alignment 
score. A high alignment score indicates high similarity. 

In computational linguistics, a quite common task consists in determining, typically 
across languages, related pieces of texts or similar words. Often, one needs to identify pairs 
of translated words as this is an important step towards building electronic lexicons or a 
translation machine. Suppose we are given two texts: the first text is in French, while the 
second text is in German. The two texts are translations of one another and we try to build 
a computer program able to determine which words are translation of one another. Many 
translation pairs exhibit great similarities. Our program should be able to automatically 
detect such similarities without prior knowledge of the languages. Let us present a concrete 
English/ German example: Let X be the English word brother and let Y be the German 
translation Y = bruder. Looking at the two words, we immediately observe a great degree 
of similarity between them. The computer should also be able to detect this resemblance. 
A first, unsophisticated method consists in writing one word beneath the other and to count 
the number of coinciding letters. We then find: 



b 


r 


o 


t 


h 


e 


r 


b 


r 


u 


d 


e 


r 





Two letters coincide: both words start with the letters br. This is not yet very conclusive. 
For example the words brag, bread, breast, bribe, bride, bring, broad, brute, all start with 
the letters hr. Hence this method of alignment is not very powerful to help discriminate 
unrelated pairs of words across similar languages. A better solution consists in aligning 
the two words allowing for gaps whilst trying to obtain the maximum possible number of 
coinciding letters. With this method, the optimal alignment turns out to be: 
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This time we get a sequence of four coinciding letters: brer. Note that brer is a common 
subsequence of X and Y . This means that the word brer can be obtained from both X 
as well as from Y by only deleting letters. It turns out that hrer is the Longest Common 
Subsequence (LCS) of X and Y . (This means that brer is the longest sequence which is 
a subsequence of both X and Y .) A further improvement consists in also allowing the 
alignment of similar letters. For example, give a score of one for identical letter, but a score 
of 0.5 when the letters are only similar. Assuming that t and d are similar, and that so are 
o and u, we find the following optimal alignment: 
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(1.1) 



The score of the above alignment is 4 + 2(0.5) = 5, since four identical letters are aligned 
as well as two similar ones. The score of the alignment (|l.ip can be written as: 

s(6, b) + s(r, r) + s(o, u) + s{t, d) + s(e, e) + s(r, r) = 4 + 2(1/2) = 5, 

where s{x, y) denotes the score obtained by aligning the letter x with the letter y. Sometimes 
a gap penalty is also in use. Note that the length of the LCS is equal to the optimal alignment 
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score when the substitution matrix s is taken to be the identity matrix and that a zero gap 
penalty is in force. Summarizing: the total score of an alignment is the sum of terms for 
each aligned pair of residues, plus a usually negative term for each gap, the so-called gap 
penalty. 

Throughout this paper the texts X and Y are iid binary random text indepen- 
dent of each other and both have length n. The optimal alignment score is denoted 
by L„. The substitution matrix is such that s{l,l) is somewhat larger than the 
other single- letter scores. In practice this corresponds to a situation where one let- 
ter is more stable than the other letters. The main result of the present paper is 
that VarLfi is of order n, when the score s(l, 1) is taken large enough. 

As already mentioned, the study of the fluctuations for the LCS is one of the 
main problems in computational biology. For the LCS-case, Steele [21], [22] proved 
that VarL„ < n, while Chvatal and Sankoff conjectured that VarL„ is of order 
o(n^/^), which is the same order (when properly rescaled), as the one obtained 
by Baik, Deift and Johansson [7] in their much celebrated result on the Longest 
Increasing Subsequence (LIS) of a random permutation. (See, [13], for more precise 
references to this problem.) In turn. Waterman [23j conjectured that in many cases 
the variance of L„ grows linearly. A similar order is obtained by p6] when one 
sequence is not random but rather periodic. 

In the LIS-problem, one picks randomly a permutation of {1, 2, . . . , n} and con- 
siders the LIS of the sequence of permutated numbers. The LIS-problem is asymp- 
totically equivalent to a Last Passage Percolation (LPP) problem on an oriented 
Poisson graph. This corresponds to collecting as many Poisson points, in a square, 
when one is only allowed to move rightward or upward. The Poisson point process is 
then homogeneous. As we will argue later, the optimal alignment problem is equiva- 
lent to a Last Passage Percolation (LPP) problem with correlated weights. For First 
Passage Percolation (FPP) and LPP, the order of the (longitudinal) fluctuation is 
not yet know in full generality, but some interesting results are available. Our result, 
also proves that the behavior of optimal alignment scores can be entirely different 
from the typical LPP and LIS-situations, despite the fact that optimal alignment is 
formally a LPP-problem. 

Let us quickly explain how optimal alignment can be viewed as a LPP-problem. 
The LPP problem consists in going in an oriented graph from point A to point B. 
Each edge e is assigned a weight w{e), which represents the time it takes to cross 
e. In the LPP problem, one is interested in finding the path which takes the most 
time to go from A to B. (In First Passage Percolation (FPP) one searches for the 
shortest path on a non-oriented graph.) In the optimal alignment problem, the set of 
vertices V is N^. Let ei := (1,0), 6*2 := (1,1), 63 := (0,1) and let Ea ■= {61,6*2,63}. 
Let the weight of the edge w{x, y) be equal to the gap penalty when y — x = ei or 
y — X = 63. When, y — x = eb, then w{x, y) is equal to the alignment score s{Xi,Yj) 
where y = and the aligned texts are X = X1X2 ■ ■ ■ Xn and Y = Y1I2 • • • Yn- 
Note that here the weights are correlated which is in stark contrast to classical LPP 
and FPP where the weights are mostly taken to be iid. In the optimal alignment 
problem the weights s{Xi,Yj) are function of two one dimensional texts. Hence, 
unlike FPP and LPP the optimal alignment problem is an hybrid problem with 
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some aspects being two-dimensional and others being one-dimensional. 

Let us now discuss a little bit more some of the history of LCS-problems. Using 
a subadditivity argument, Chvatal and Sankoff 9j prove that the limit 

7 := lim 

n— >oo fi 

exists, where L„ is the length of the LCS of two independent iid sequences of length 
n. To this day, the value of the Chvatal and Sankoff constant 7 > 0, is unknown, 
even for binary sequences. Waterman and Arratia [5j derive a law of large deviations 
for the fluctuations of L„ on scales larger than ^/n. In this groundbreaking article, 
they show the existence of a critical phenomenon. Using first passage percolation 
methods, Alexander [Ij proves that E[Ln]/n converges to 7 at a rate of order at 
least y^log n/n. In [23j, Waterman studies the statistical significance of the results 
produced by sequence alignment methods. 

The determination of the order of the the variance for the LCS problem is what 
will be of concern to us in the rest of the text. 



2 The Main Result 

Throughout this paper {XjjjgN and {lijieN are two independent iid sequences of 
Bernoulli random variables with parameter 1/2. We also assume in everything that 
follows that the substitution matrix is such that: 

s(l, 0) = s(0, 1) = and s(0, 0) = 1, (2.1) 

while the gap penalty is taken equal to zero: 

q = (2.2) 

We consider the two sequences of equal length X := X1X2 ■ ■ ■ Xn and Y := Y1Y2 . . . 
An alignment is a pair of increasing sequences (vr, v) such that 

1 < 7r(l) < 7r(2) < ... < TT{k) < n 

and 

1 < viX) < ^^(2) < ••• < v{k) < n, 

where vr = 7r(l)7r(2) . . . 7r(/c), v = z^(l)i/(2) . . . i^(A;) and k < n. The score of the 
alignment (vr, u) is defined as 

k 

Sin,u) ■.= Y.s{X{7T{i)),Y{um. 
1=1 

The optimal score is defined by 

Ln := maxS'(^ ,^), 

where the maximum is taken over all possible alignments (vr, v). 

The main result of the present paper asserts that if the score s(l, 1) is large 
enough, then the variance of the optimal score is of order n. More precisely. 
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Theorem 2.1 Let {XjjjgN and {l^jieN be two independent sequences ofiid Bernoulli 
random variables with parameter 1/2. Let the substitution matrix be such that 
s(l,0) = s(0, 1) = 0, s(0, 0) = 1 with moreover no gap penalty, i.e., q = 0. Then, 
there exist c > and s > 0, such that if s{l, 1) > s, then 

VarL„ > c • n. (2.3) 

for all n G N. 

The last theorem imphes that VarL„ = G(n), using the upper bound obtained, via 
the tensorization property of the variance, in [2T], |22j . 

The main idea developed in proving the above theorem, is to show that when 
changing the length of a randomly chosen block in X, then this process has a 
tendency to increase the score. Since the number of blocks of a certain length has 
fluctuations of order 0(n) this, in turn, implies that VarL„ is of order Q{n). 

Let us now explain what a block of X is, starting with a formal definition. 

Let < i < j < n. The interval [i,j] is called a block of zeros (resp. block of 
ones) in X if 

Xi = = ... = Xj = 0, {resp. = 1) 

but = 1 (resp. 0) (or i = 1) and ^j+i = 1 (resp. 0) (or j = n). The integer 

J — i + 1 is then called the length of the block [i,j]. 

Here is a numerical example: Let X be equal to the string: 000011100000. The first 
block in this case consists of four zeros and has length four. The second block consists of 
three ones and has length three. The third block is a block of zeros of length five 

Next, let us described our transformation: We pick a block of zeros of length 
five at random among all the blocks of zeros of length five in X. For this we use the 
equiprobable distribution and this selection process is independent of Y. Then, we 
take off one zero from the chosen block. The block becomes a block of length four. 
The next step is to add the zero just removed to a randomly chosen block of zeros 
of length one. This block then becomes a block of length two. Again, to chose the 
block of length one, we use the equiprobable distribution on all blocks of zeros and 
length one in X. The string X gets transformed in this way into a new string with 
same length. This new string is denoted by X. 

Let us, once more, consider a numerical example. Let X = 0101000001000001. The 
string X has two blocks of zeros of length one as well as two blocks of zeros of length five. 
Assuming that the second block of length five gets chosen and so is the first block of length 
one, we would obtain: X = 001010000010000. 

The optimal alignment score of X with Y is denoted by L^. Hence: 

k 

in :=maxJ^s(A(^(i)),y(z.(i))), 
1=1 

where the maximum is taken over all alignments (tt, v). 

We show that, when s(l, 1) is taken large enough, then L„ tends to be larger than 
Ln- This is the content of the next theorem: 
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Theorem 2.2 Assume that ()2.ip and (j2.2p 6oi/i /loW. T/ien, there exist ei,ci, s > 
anc? s > all independent of n, such that if s{l, 1) > s, then 

P(^") > 1 - e-'^i", (2.4) 

for all n G N, where is the event that the following two equations both hold: 

SI 1 

P(L„-L„ = l|X,y)>-.--6i, (2.5) 

P(L„-L„ = -l|X,y) < ^ + ei. (2.6) 

Let us explain the main idea behind the previous theorem: when s{l, 1) is large 
most of the ones will get matched with ones. So, we are close to a situation where 
we match all the ones and match as many zeros in between ones as possible. For an 
alignment which tries to match all the ones, the blocks of zeros between matched 
ones are iid. The distribution of the length of the blocks of zeros between matched 
ones is geometric with parameter 1/2. 

Let us look at another numerical example of an alignment obtained by matching all the 
ones. Take X = 1011000001 and y = 10010101. When we match all the ones and as many 
zeros as possible in between matched ones, we obtain the following alignment: 

10_1. 1000001 
1001010. .__1 

In our current example, the first block of zeros of X has length 1 and the third has length 
5. If we shorten the third block by removing a zero and adding it to the first block the 
alignment-score gets increased by one. The reason for this is that the third block of zeros of 
X is matched with a shorter block so removing a zero does not reduce the score. However 
the first block of zeros of X is matched with a longer block of Y so adding a zero increases 
the score by one unit. When we take a bit from the third block to add it to the first block 
of zeros of X, we get the following new alignment: 

1 1 . 1 1 
1 1 1 ... 1 

This alignment has score 7. This is one unit larger than the score of the alignment before 
modifications. 

In general the alignment which aligns all ones is not an optimal alignment. It is very 
difficult to understand how the optimal alignment looks macroscopically for long 
texts. There are extremely complicated dependencies between all the parts of the 
optimal alignment. But despite the horrendously complicated macroscopic behavior, 
when most ones are matched, the local distribution of the optimal alignment is close 
to the alignment where we match all the ones. Furthermore, the alignment where 
we match all the ones consists of a sequence of iid blocks of zeros between matched 
ones. 

For the alignment with all ones matched, we have that: We can compute the 
probability that the score increases by one when we take a zero from a block of X 
of length five and add it to a block of length one. The block are chosen at random. 
The corresponding blocks in Y are iid and have geometric distribution. Hence the 
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probability to have the ahgnment score increased by one is the probabihty that 
the chosen block of length five is matched with a block of shorter length times the 
probability that the chosen block of length one is matched with a longer block. 
Hence the probability of an increase in score is: 

31 1 

P{Z < 5) X P(Z > 1) = - • -, 

where Z is a geometric random variable with parameter 1/2. 

Similarly, the probability that the score decreases by one unit is the probability 
that the block of length five is matched to a block of length five or longer, times the 
probability that the block of length one is matched to a block of length one or no 
zero. Hence the probability that the score decreases by one unit is 

P{Z > 5) X P{Z < 1) = 
The probability to have the score unchanged is: 

1 3 31 



P{Z > 5) X P{Z > 1) 



32-4 4 32 



Theorem 12.21 savs that the probabilities for the optimal score to increase/decrease 
through our bit transfer procedure, is close to the probabilities for the alignment 
which matches all the ones. (For this we assume that most ones are matched due 
to the score s(l, 1) being large). Theorem 12.21 is proved in Section 4 and Section 5 
using exponential estimates and some combinatorics. In the next section, we prove 
that Theorem 12.21 implies Theorem 12. li 



3 Proof of the Main Theorem 

The purpose of section is to prove that Theorem 12.21 implies our main Theorem 12. 1[ 
We first need a few definitions. 

Let Ni be the number of blocks of zeros of length i in the string X = X1X2 ■ ■ ■ Xn, 
and let N be the vector N = {Ni, N2,N4, N^). Let Ha = {(ni, 0, 71.4, n5)|ni, n4, G 
N} and let Hh = {(ni, n2, 0, 'ra5)|ni, n2, ns G N}. Let e := (—1, 1, 1, —1). It is easy to 
check that every integer vector n = (ni, n2, 71.4, ns) G N'* can be written in a unique 
way as a vector of Ha U Hb plus a term n ■ e where n G N. 
For every ft G Ha U Hf,, we define a sequence of random strings: 

X{n),X{fi + e),X{n + 2e), . . . , X{n + fee), . . . 

The sequence is defined by induction on k: 

-First, X{n) is a binary random string of length n having distribution C{X\N = ft). 
We also ask that X{n) be independent of Y . 

-Once, X{n + ke) is defined, let X{n + {k + l)e) be the string obtained by taking 
one zero from a block of length five and adding it to a block of length 1. For this 
we draw one block among all the blocks of zeros of X{n + ke) of length five and 
then we pick another block of zeros of length one in X{n + ke) and turn it into a 
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block of length two. The blocks are drawn with equal probabihty among all blocks 
of length five, resp. one. 

Let us look, once again, at a numerical example. Take n = 16 and n = (2, 0, 0, 2). Hence 
the string X{n) in this case, is a binary random string of length 16. It distribution is the 
conditional distribution of an iid string, given that there are: 
-two blocks of zeros of length one, 
-no blocks of zeros of length two or four, 
-two blocks of zeros of length five. 

We generate the string X{{2, 0, 0, 2)) with this distribution. We could get for example 

X((2,0,0,2)) = 0101000001100000. 

To obtain the next string, that is ^((1, 1, 1, 1)) wc pick one of the blocks of length five at 
random and turn it into a block of length four. Then, we add a zero to one of the block of 
length one. There are two blocks of length five, so they have both equal probability to get 
chosen. Assume that the randomly chosen block of length five is the second one. Assume 
that the randomly chose block of length one turns out to be the first block. Then, we would 
have that 

X((l, 1, 1, 1)) = 0010100000110000. 



Let L{'n) be the optimal alignment score of X{n) and Y. 
The next lemma says that for any m G N^, such that 

P{-m = iV) > 0, 

we have that L(m) has the distribution of L„ conditional on N = m. 
Lemma 3.1 Let m G N'^ be such that 

P{fh = N)>0. 

Then, L{rh) has distribution 

C{Ln\fn = N). 

Proof. We need to prove that for any m G with P{N = m) > 0, we have that 

C{X{m)) = C{X\N = m). 

The proof is by induction on k. By definition, we have that X{fi) has distribution 

C{X\N = n) when n e HaLI Hi,. 

The next step is to prove that when X{n + ke) has distribution: 

c(^X\N = n + kej , 
then X(n + {k + l)e) must have distribution 

C (^X\N = n + {k + . 
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Let m G be such that P{N = m) ^ 0. The distribution C{X\N = m) can be 
characterized as the equiprobabihty distribution on the set of strings x for which 

N{x) = rh. 

Here N{x) G is the vector (nf , nf, raf, rig), where nf denotes the number of blocks 
of zeros of length i in the string x. 
Let n* be the vector n* := n + ke. Let 

So we only need to prove that all the possible realizations of X{n* + e) are equiprob- 
able and that for every x in the set 

{xG {0,ir|iV(x) =r + e}, (3.1) 

the probability 

P{X{n* +e) =x), 

is non zero. 

Any string x, such that N{x) = n* + e has a non-zero number of blocks of zeros of 
length two and of length four. Take any of these blocks of length four and any block 
of length two. Let y be the string obtained by reducing the chosen block of length 
two by one bit and adding this bit to the block of length four. By the induction 
assumption, the probability for X(n*) to be equal to y is non-zero. The conditional 
probability that X{n* + e) = x given X{n*) = y, is equal to 1/(7x571^) which is 
different from zero. Hence, P{X{n* + e) = x) is larger than zero since 

It remains to prove that for all x such that 

N{x) = n* + e, 

we have that P{X{n* A-e) = x) does not depend on x. By the law of total probability 
we have that 

P{X{n* +e)=x) = Y^ P{X{n*) = y) ■ P{X{n* + e) = x\X{n*) = y). (3.2) 
y 

The sum in the equality above is taken over all strings y which can be transformed 
into the string x by taking a bit from a block of zeros of length five and adding it 
to a block of length one. For each pair consisting of a block of zeros of length four 
and a block of zeros of length two of x there is such a y. Hence the number of terms 
in the sum on the right side of the equality (|3.2p . is equal to 

nf X nl = {nl - l){nl - 1). 
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By the induction assumption, for a string y such that N[y) = ft*, we have that 
P{X{n*) = y) does not depend on y. 

The probabihty for y to get transformed into x, is equal to: 
PiX{n* + e)= x\Xin*) = y) = 

Hence (j3.2p becomes: 

+e)=x)= P{X{nn = y) ■ + + (3.3) 

where y is any string such that N{y) = n* . As aheady mentioned, by the induction 
assumption, for such a y, we have that P{X{n*) = y) does not depend on y. Hence, 
the expression on the right side of ()3.3p does not depend on x. This finishes the 
proof. ■ 



We assume that the random variables L[n) were constructed so that they are 
independent of N . Then L{N) has same distribution as L„. Hence, 

Var[L„,] = Var[L(iV)]. (3.4) 

We mentioned that any vector of can be represented in a unique way as a sum: 
n:=mi + ke. Where mi e Ha + Hh and k eN. Let M e HaU Hh and Mi G N be 
defined by the equation: 

N = M + Mie. (3.5) 
We also write L{Mi,M) for L{N). 

Let H"^ be the intersection of Ha U Hf, with B{0, \/n). (Here B{0, y/n) stands for 
the ball of radius y/n centered at 0.) 

Let H2 := {ke\k G [—y/n, V^}- Let /2 := (/ii, /U2, ^4, fJ-5) where 

/ii := E[Nl],^i2 := E[N2], ^14 := E[N4],fi5 ■= ElN^]. 

Let 

r :=C ■ H"" + C ■ H2 + /2, (3.6) 
where C > is a constant not depending on n, chosen large enough so that 

fl+ [-^55n/4, V55n/4]^ C I", (3.7) 

holds. Let E^^^^^ be the event that for all n^, fib G such that fih — fia = ke, with 
k > n^'^, we have that 

L{nb) - L{na) > 0.01|nfe - na\. 

For any random variables V and W, as usual Var[y|VF] = - (£'[y|M^])^, 

hence 

Var[F] = E[\aT[V\W]] +\aT[E[V\W]], 
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and so 

\ar[V] > E[YaT[V\W]]. 

This give in our case, 

Var[L(iV)] > E \ Yai[L{Mi, M)\L{.), M] 1 . (3.8) 
Since the variance is positive, we find 



E 



Var[L(Mi,M)|L(.),M] > (3.9) 



> E 



Var[L(Mi,M)|L(.),M] Ner,E2,,^^ • P(iV G I") • P(i?Oope)- (3-10) 



Note that when we condition on M and hold M fixed, then L{Mi,rn) becomes a 
function of the one dimensional variable Mi. When the event -E^opg holds, that 
function has "positive slope on the scale n'''^". We are hence interested in the 
variance of a non-random function of a random variable. For this, assume that 
f{uj) : M ^ M is such that f'{x) > c, for all x G R (where c > is a constant not 
depending on x). Then, for any random variable Y, we have 

Var[/(y)] > c^Yav[Y]. (3.11) 

Hence, if the map x L(x, M) would have partial derivative along x everywhere 
larger than c > 0, is would follow that Var[L(Mi,m)] is larger than (? • Var[Mi|M]. 
Typically, the integer map x i— > L(x, M) is not strictly increasing in the direction of 
e. But it is likely that the event E'^^^^^ holds, and hence there is a linear increase in 
the direction of e every nP'"^ points. 

We are next going to formulate a lemma which is a modification of (j3.1ip . for when 
the map /(w) does not increase every A;, but has a tendency to increase on some 
scale: 

Lemma 3.2 Let c,m > be two constants. Let f : TL ^ 7L he a non decreasing 
map such that: 

• for all i < j: 

f{j)-f{i)<ij-i) (3.12) 

• for all i,j such that i + m < j: 

fij)-f{i}>c-{j-i). (3.13) 
Let B he an integer random variable such that E\\f{B)W < +oo. Then: 

We can now use Lemma 13.21 to give a lower bound on Var[L(Mi, M)|L(.), M] when 
-^siope holds and assuming that we condition on G 7". When E^^^^ holds, the 
integer map 

X ^ L{x,M), 



11 



restricted to {x|xe + M G /"} satisfies the conditions of Lemma 13.21 with m = nP-^^ 
and c = 0.01. Hence, we obtain that conditional on G and when -E^^pg holds, 
then 

(2^0.1 
1 -^^^=^^^= I Var[Mi|M, N eP 
0.01^Var[Mi|Al,iV G /"] 

(3.15) 

Next we need the following lemma: 

Lemma 3.3 Assume that W is a random variable taking its values in Z. Assume 
that K > is a constant independent of n. Let J he an interval of diameter at least 
3Kln2\/n, n>l, such that P{W G J) = 1. Assume that 

P{W = i + l)>P{W = i). ^ (3.16) 

and 

P(W = i)>P{W = i + l)- (l- , (3.17) 



for all i,i + 1 G J. 

Then, 

Var[VF] > n . (3-18) 

for every n large enough. 

Proof. Let / be the interval 

/ := [ E[W] - Vnln2/{2K) , E[W] + ^In2/(2K) ]. 

Let Ir (resp. I;) denote the interval of length In2y^/K directly adjacent to the right 
of / (resp. to the left of /). Then either P <Z J or Ii <Z J. Let us assume that P C J 
and leave the other case to the reader. 
For every i G /, we have 

i + In2y^/K G P. 

(To simplify notation, we assume that ln2^/n/K is a natural number.) Let j := 
i + ln2^/n/K. By the assumption (13.171) . we find that 

/ . \ In 2^/n/K 

P(W = j)>P{W = i)- [l-^] . (3.19) 



/n. 

The last inequality above yields 

^ In 2y/n/K 

P{W G P) > P{W G /) • ( 1 - 

V \/n 



and so 

/ ^ \ ln2Vn/K 

P{W il)> P{W G /) • 1 - ^ . (3.20) 
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Note that 

N In 2v^/k , 



and thus for n large enough, 



n I ^3 



The last inequality together with ()3.20p . yields 
from which it follows that 

p(w n> ^ 



P{WiI)>\. (3.21) 



We have that 



Var[VF] = ■ (i - E[W])^ > ^ p^ ■ {i - E[W])^, (3.22) 

ieJ ieJ\I 



where pi := P(W = i). 
Note that for every i ^ I, 



{i-E[W]f > 



nln2^ ^ 



2k 

Hence, the right side of inequality (j3.22p is larger than 

fin 2^12 

Using this with the inequality (I3.21[) in (|3.22p . we obtain that 

^ ^ - 16«;2 



The next lemma will also be useful: 

Lemma 3.4 There exists a constant cm > 0, not depending on M or n such that 

Yav[Mi\M,N e r']>CM -n (3.23) 
for all M, such that P{M,N £ I") > 0. 

Proof. Assume that n = (^i, n2, ?t.4, ^5) G I". Let mi € N and m € Ha U Hf,, be 
such that 

n = m + mie. 
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(Note that the last equahty uniquely determines m and mi). 
Let Ii be the integer interval such that 

(N • e + m) n = /i • e + rh. 

We also write I{fh) to indicate that Ii only depends on m. Note that the condition 

M = m,N £ P, 

can now be rewritten as 

M = m,Mi G Ji. 
The equation (|3.3p can be rewritten as: 

P(M = m, Ml = mi + 1) _ (n2 + l)(n4 + 1) 
P(M = m,Mi = mi) nirir^ 

and so 

P{Mi =mi + 1\M = m, Ml G /i) (n2 + l)(n4 + 1) 



(3.24) 



(3.26) 



(3.25) 

P(Mi = mi|M = m,Mi G Ii) "i?^5 
We have that there exist a,b,c,d > not depending on n, such that the following 
four inequalities hold 

1/^1 — n| < a\/n, 

I/U4 — r^l < c\/n, 
\H5 - rsl < d^/n, 

for any r = (ri, r2, r4, rs) G 

Hence there exist a*,b*,c*,d* > 0, such that 

(n2 + l)(n4 + 1) ^ (At2 + yV7I)(;U4 + c*V^) 
nins (//i + a* y/n) (/is + d* y/n) ' 

and such that \a*\ < a, \b*\ < b, \c*\ < c and \d*\ < d. Note that 

n 

/^i = ^, 

which implies that 

{H2 + b*./E){fii + c*^) ^ i0.5^ + b*/^){0.5^ + c*/^) 
(/ii + a*^/n){n5 + d*^/n) {0.5^ + a*/^/n){0.5^ + d*/^/n) ' 

Let 

(0.52 + x)(0.5^ + y) 

r : (x,y,z,w] ^ ^ -. -. 

^ ' ' (0.51 + z) (0.54 + 71;) 

Note that the maps / and 1/f are both continuously differentiable in an open 
neighborhood of = (0,0,0,0) and that /(O, 0,0,0) = 1. This implies that there 
exist a constant k > not depending on n, such that for all n large enough, we have 

' ^ (0.5^+6VV^)(0.54 + cVV^) ^ 



(3.27) 



Ky/n ~ (O.51 +a*/\/ra)(0.55 +d*/\/n) 
14 



which holds for every a* ,b* ,c* ,d* such that \a*\ < a, \b*\ < b, \c*\ < c and \d*\ < d\. 
Combining, l^2B, dSSS]), 1^^^ and (|3:28D . we find that 

P(Mi = mi + IIM = m,Mi£ h) ^ 

> 1 — K.yn^ 

P{Mi = mi\M = m,Mi £ h) 

and 

P(Mi = mi\M = m, Mi € h) r- 

> 1 — K\/n. 



P{Mi = mi + l\M = m, Mi G h 

Note that the two last inequalities above are nothing but the conditions (j3.16p and 
()3.17p in Lemma 13.31 For this we take for the random variable W of Lemma 13.31 
the random variable Mi conditional on M = m and Mi G Ii. In order to apply 
Lemma 13.31 we also need to verify that the interval I{m) has diameter of at least 
3Kln2\/n. For this, note that by choosing the constant C > in the Definition 13.61 
of large enough (but not depending on n), we get that the diameter of I{m) is 
larger than 3k In for every m, such that 

P(M = m, iV E I") > 0. 

We can now apply Lemma [33| and obtain that 

(In 2)2 



Var[Mi|M = m, Mi E h] = Var[Mi|M = m,N e r]>n 



16/t2 



Next, applying inequality p.23p to ()3.15p . we find that conditional on E 
and when -E^ope holds, 

Var[L(Mi,M)|L(.),M] >n - fo.Ol^ (l - J'''^'' ) cm) . (3.29) 

Using (I3.29p . we see that (13. 9j) becomes: 



E 



Var[L(Mi,M)|L(.),M] > (3.30) 



> n • ( CM O.Ol^ ( 1 - ^^0 , ) ) • G /") • P(i^riope)- (3.31) 



0.0iyS]^7n' 

With the help of and of ([O]) . we then find 



Var[L„] > n • [cu O.Ol^ - . P(N E /") • P(El,^,). (3.32) 

To finish the proof of Theorem 12. II it suffices to prove that P(iV E I") and -P(-E'^ope) 
are both bounded uniformly below by a constant as n goes to infinity. This is the 
content of the two next lemmas. 
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Lemma 3.5 We have that 

p(N £ r) > -. 

5 

Proof. Let Zj be the indicator function which is equal to one if 

Xi = 1, Xi^i = 0, Xij^2 = 1- 

and Zi = otherwise. In other words, = 1 if there is a block of one zero in X 
starting at the position i. We have that 



n— 3 



i=l 

and hence 

n—3 

Yav[N,]=E[{Y^{Z,-E[Z,])f]=Y^E[iZ,-E[Z,]){Z,-E[Zj])]. (3.33) 
t=l i,j 

When \i — j\ > 2, then Zi and Zj are independent of each other and: 

E[{Zi-E[Z,])iZ,-E[Z,])]=0. 

This implies that in the sum on the very right side of (j3.33p , there at most 5n terms 
which are different from zero. By Cauchy-Schwarz, we have 



E[iZi - E[Zi]){Zj - E[Z,])] < ^E[{Z, - E[Z,])^] ■ E[{Z, - E[Z,]y] < Var[Zi]. 

Hence, for all i, j £ [0,n — 3] we have 

E[{Z,-E[Z,]){Zj-E[Z,])]<^ (3.34) 

since there are less than 5n non-zero terms in the sum on the right side of (|3.33|) . 
we obtain with (j3.34p that: 

Var[iVi] < ^ (3.35) 

Similarly we obtain 

Var[iV2] < < ^, Var[7V5] < — . (3.36) 
In (|3.7p . we have chosen /" such that 

fl {NiGim- V55n/4, f^i + ^bbn/A ]} C {iV G } , 

1=1,2,4,5 

and hence 

P{N^n< Pi^i ^il^i- \/55n/4, fii + V55n/4 ] ). (3.37) 

1=1,2,4,5 
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The Bienayme-Chebycheff inequality yields that 

/ij + Y^55n/4] < 



Var[iVi; 



55n/4 



and hence together with (13.360 and ()3.37p . this yields: 



i=l,2,4,5 



4 55n 5 



finishing the proof. 



The next lemma will be useful. Recall that ^li is the expected number of blocks of 
zeros of length z in X. 



Lemma 3.6 For all n and all i, we have that 

^ \ i+2 



< i. 



(3.38) 



Proof. Let Kj be the Bernoulli random variable which indicates if there is a block 
of zeros of length i starting at the point j in X. In other words, R) = 1 



Xjjf-i—i — and Xj—i — -^j+i — 1 



for j £ [2,n — i]. 

For j = 1 we have Kj = 1 

Xi=X2 = ... = Xi = and Xi+i = 1. 

For j = n — i + 1, we have that Kj = 1 

^n-i+l = = ■ ■ ■ = Xn = and 

We find that 

n—i+l 



1. 



N^= ^ 



and hence 



n—i+l 



f,, = E[Ni]= ^ E[R)]. 

Note that for j e[2,n-i], we have that = (l/2)*+2. 

We also have that E[R\] = = (l/2)*+i. 

This with ([339D . yields 

M, = (n-i). (1/2^+2 + 2 •(1/2)^+1. 
This last equality directly implies ()3.38p . 



(3.39) 



(3.40) 



Our next lemma is related to the local limit theorem for multidimensional regener- 
ative processes: 
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Lemma 3.7 There exist a constant ki > such that for all all n, we have: 
for all fi E I^, the inequality 

P{N = n)>n-''\ (3.41) 

holds. 

Proof. Let Tj be the end of the i-th block in the infinite sequence Xi , X2 , , . . .. 
To simphfy notation, we assume that the finite sequence X = X1X2 ■ ■ ■ Xn got 
extended to an infinite sequence of iid BernoulU random variables with parameter 
1/2. In this way, Tj is well defined even when Ti > n. (Note that here wc consider 
all the blocks, and not just the blocks of zeros). Let Zj denote the length of the i-th 
block of X. Hence, we have that Zi = Ti — Tj-i and 

Ti = Zi + Z2 + ... + Zi. 



For a numerical example: Take X = 00111001 . . .. Then find Ti = 3, = 5 and T3 = 8. 

Note that ri,T2,... are the arrival times of a renewal process. The interarrival 
times ^1,^2,... are iid geometric random variables with parameter 2 and hence 
E[Zi] = 2. 

We assume that n/4 is an integer in order to simplify notation. There are n/2 
blocks in X iff T„/2 = n. 

The event that the blocks of ones cover half the text X can be described by the 
equality 

n 

Y.^i = l- (3-42) 

i=l 

Let n = (ni, n2, 714, ns) G i". We have that 

P{N = n)>p(^N = n,J2Xi = ^, T^,^ = ^ . (3.43) 

Let be the indicator variable which is equal to one iff the i-th block of zeros in 
X had length j and V? =Q otherwise. Let 

V,:= {Vl,Vl,Vt,V^). 

Note that Vi^V^.,. . . form a sequence of iid random vectors. 
The event 

|iV = n,fjX, = ^,r„/2 = n| 

can be interpreted as the event that there are n/2 blocks in X and that the blocks 
of ones cover half the text and that N = n. Its probability can be computed as 
follows: First compute the contribution due to the fact that the blocks of ones cover 
half the text. Half of all blocks are blocks of ones. Hence we have exactly n/4 blocks 
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of ones. These blocks are iid with geometric distribution with parameter 1/2. This 
gives us a factor 

P{Z, + Z2 + . . . + Z„/4 = n/2) (3.44) 

Second, we compute the probabihty that among the first n/4 blocks of zeros we 
have the right number of blocks of length 1, 2, 4 and 5. We find 

P(yi + ... + K/4 = n). (3.45) 

Finally we compute the probability that the remaining n/4 — ni — n2 — n.4 — 725 
blocks of zeros, cover up a total length of n/2 — ni — 2n2 — 4n4 — 571.5. This yields 
the probability 

P{Wi + W2 + ... + Wn* =n/2-ni- 2n2 - 4^4 - bn^), (3.46) 

where n* := n/A — rii — n2 — rn — and Wi, W2, ... is a sequence of iid random 
variables with distribution 



CiZi\Z, ^ {1,2,4,5}). 



Summarizing, we find 



P(iV = n,^X, = ^,r„/2 = n) = 
1=1 

P{Zi + . . . + Z„/4 = n/2) ■ P{Vi + . . . + V;/4 = n) • P{Wi + . . . + Wn' = n/2 - n), 

where n = ni + 2n2 + 4n4 + hn^ and hence with (|3.43|) : 

P{N = n)> P{Zi + . . . + Z„/4 = n/2) ■ P{Vi + ■ ■ ■ + V^/^ = n) 
■PiWi + --- + Wn* =n/2-n). 

We have that (n/4) • E[Zi] = n/2. Hence by the local limit theorem, we find that 
there exists k2 > not depending on n such that 



P{Zi + . . . + Z„/4 = n/2) > k2 ■ n-°-5 (3.47) 



for all n. 

Note that 



E[VI] = ( ^ ) ' (3-48) 



and so, together with the inequality (j3.38|) . this yields: 

1/2- (n/4) ■E[Vi]\ < 1 + 2 + 4 + 5 = 12, (3.49) 

where |.| denotes the norm in M'* for which \{x,y, z,w)\ = \x\ + \y\ + \z\ + \w\. 
We have that 

|n - (n/4) • E[Vi] \ < \n - fl\ + \fl - (n/4) • E[Vi] \ . (3.50) 
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Since, n G we have that there exists a constant K > not depending on n or 
n G such that 

\n-fl\<K■^/^ (3.51) 
Combining ([3^ . ([330]) and ([33T]) yields 



\n - (n/4) • £;[Vi]| <K-VN + 12 

which hold for every n and every fi £ I^. From the last inequality above, with the 
help of the local limit theorem, we obtain that there exists > not depending 
on n or n G /" such that 

PiVi + . . . + = n) > ^3 • (3.52) 

Next, we want to prove that 

\n*E[Wi]-n/2 + ni+n2 + n4 + n5\ (3.53) 

is of order ^/n. Let 

n fl 
Uj := — — 
4 V2 

We know there is a constant K > not depending on n or n € such that 

\fi — n\ < K\/n. 

From this and from 13.38] it follows that there exists K2 > such that the difference 
between ()3.53p and 

|(n/4 — ui — U2 — U4 — U5)E[Wi] — n/2 + ui + U2 + + usj (3.54) 

is less than K2\/n. 

Now consider a sequence of n/4 iid geometric random variables with parameter 1/2. 
The expectation of the sum of these random variables is: 

Let W deiiote the sub-sum obtained by only taking the terms not equal to 1, 2, 4 
or 5. Let W denote the sub-sum obtained by taking only those terms in the sum 
which are equal to 1, 2, 4 or 5. 

Let b denote the total number of random variables among our collection of n/4, 
that take a value equal to 1,2,4 or 5. 

Note that the probability for one variable to take on the value i is equal to (1/2)*. 
Hence, the expected number of variables among our set of n/4 which take on the 
value i is equal to (n/4)(l/2)* = n,. We find 

S[T„/4]] = E[W + W]= E[W] + E[W], 

but 

E[W] = E[Wi] • E[{n/A) - b] = E[Wi] • ((n/4) - m - U2 - U4 - U5) 
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and 

E[W] =ui + 2u2 + 4:U4 + 5u5. 
Combining the last three inequahties yields: 

n/2 = £^[T„/4]] = E[Wi] ■ ((n/4) - m - U2 - - U5) + m + 2u2 + ^u^ + 5n5, 

and hence 

= £;[VFi] • ((n/4) -ux-U2-u^- U5) - (n/2) + ui + 2u2 + 4n4 + 5^5. (3.55) 

Thus, the expression (|3.54p is equal to zero and so 

\n*E[Wi] - n/2 + ni + n2 + n4 + ns] < K2 ■ Vn. 

The above inequality combined with the local limit theorem yields that there exists 
k4 > 0, not depending on n or n G /" such that 

P{Wi + W2 + ... + Wn* =n/2-ni- 2n2 - 4n4 - 5n5) > ki ■ n^/^ (3 55) 

The inequalities (I33ZD, and together imply ([231]). ■ 

The next lemma is proved assuming that Theorem 12.21 holds. (For the proof of 
Theorem 12.21 see Subsection 15. 2p . 

Lemma 3.8 There exists s > not depending on n such that if s{l, 1) > s, then 
as n ^ 00. 

Proof. Let m € Ha U Hh, be such that 

P{M = m,N gP)> 0. (3.57) 

Let be the event that -E^opg holds on the subset I{m) ■ e + rh. (The set /(m) 
got defined in the proof of Lemma 13. 4p . More precisely, E"^ is the event that for all 
i,j G lim), such that j — i > n^'^, we have that 

L{rh + je) — L(rn + ie) > 0.01 • \ j — i\. 

We find that 

^siope = n -E^r^' (3.58) 

m 

where the intersection, above, right hand side is taken over all m G Ha U H^, such 
that (j3.57p holds. Note that there exists a constant c/ (not depending on n) such 
that for all n there are less than c/ • n-^ points in the set Hence there are also less 
than cj ■ vectors rh G Ha U i^fe satisfying (j3.57p . It follows that in the intersection 
on the right side of (j3.58p . there are less than c/ • n^ terms. Now, (j3.58p implies 
that 

P{EZp.)<Y.P^^^^)^ (3.59) 
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where the sum on the right side of the last inequality is taken over all fh G Ha U Hh, 
such that (|3.57p holds. There are less than c/ • terms in the sum on the right 
side of (|3.59|) . Hence to prove the lemma, we only need a negative exponential 
upper-bound for P{E^) which does not depend on rh. 

Let A" denote the event "that j4" holds for X{m + ie)" . More precisely, Af is the 
event that the following two conditions hold: 

31 1 

P{L{m + + l)e) - L{m + ie) = l\X{m + ig),Y)>—--- ei, (3.60) 

P{L{m + + l)e) - L{m + ie) = -l\X{m + ie),Y) < ^ + ei. (3.61) 

Note that the difference between L{m + (i + l)e) and L{m + (i + l)e) is at most 
one. We assume that the constant ei < 7/32. The inequalities p.60p and p.6ip 
give then that for every (x,y) G A'^, we have 

E[ L{m + + l)e) - L{m + ie) = l\X{m + ie) = x,Y = y) > ^ . (3.62) 

From a positive bias like in (|3.62p . one can hope to prove that the event E^ holds 
with probability one minus an exponentially small quantity. The only problem is 
that inequality (j3.62p holds only for {x,y) £ A^. If we condition on njg7(^)A", we 
introduce complicated dependencies so that we can no longer use large deviation 
results for martingales. The trick is to introduce help- variables Yi. When, Af holds 
let 

Yi := L{m + {i + l)e) - L{m + ie), 

otherwise let Yi := 1. We have that Yi is a{Y,X(rn + je)\j < i)-measurable. Also, 
because of (I3.62p . we have that almost surely, 

E[Yi\Y,X{m+je)]>^. (3.63) 

Let E^Y be the event that for all i,j£ -^(w-), such that j — i > n^-^ , we have that 



Y,Yk>OM-\j -i\. 

k=i 

Note that when njg/(^)yl" holds, then the events E^ y identical. Hence, 

and hence 

P{E-^) < P{Eiy) + J2 ^(^D- (3.64) 

i£l{rn) 

Note that X{m + ie) has distribution C{X\N = m + ie). This implies that 
p(^nc) ^ P(A"^n{iV = m + ze}) ^ P{An 



P(N = m + ie) P{N = fh + ie) 
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Using the last inequality with ()3.4ip . we obtain 
The last inequality implies: 

Pi^T) < CI ■ n^+'''P{A'''). (3.65) 

j£/(m) 

Classical large deviations and the inequality ()3.63p show that 

PiE'iy) < e-''^-''°\ (3.66) 

where A;y > is a constant independent of n. Next, (|3.64p and ()3.65p together 
imply that 

PiElt^) < e-'^^-"" ' + CI ■ n^+''^P{A''^). (3.67) 
We can now plug (|3.32p into (|3.67p to obtain 

The upper-bound in the last inequality above is exponentially small in n^'^. Hence 
by ()3.59p . we get that -P(-E'2ope) ™ust also be exponentially small in nP'^. m 



4 Combinatorics 

We already mentioned that P{Xi = 1) = 1/5. Let e > be a small quantity not 
depending on n. 

Let Bq be the event that in both X and Y there are about 0.5n ones. More precisely, 
Bq is the event that 



i=l 



and 



i=l 



n 
2 

n 
2 



< 



en 
16 



< 



en 
16 



both hold. Let B^ be the event that any optimal alignment of X and Y contains 
at least ^ — en/8 pairs of aligned ones. 



Lemma 4.1 Assume that s{l, 1) and e > satisfy (14. 3p below. Then, 

BS C B"^ 

for all n. 



(4.1) 
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Proof. Let vi be the alignment of X and Y which ahgns only ones and as many 
as possible. Let Si denote the score obtained by aligning X with Y via v. When, 
Bq holds, we obtain 

5i > • (0.5n- en/16), 

and therefore 

Ln>s{l,l)-{0.5n- en/16). (4.2) 

We will assume that 

s(l,l)>-. (4.3) 
e 

The condition (j4.3p implies 

(0.5n - en/16) • s(l, 1) > (0.5n - en/8) • s(l, 1) + n, 

which, when applied to (j4.2j) gives 

L„ > (0.5n - en/8) • s(l,l) +n. (4.4) 

Recall that s(0, 0) = 1. Hence, if the texts X and Y consist only of zeros the 
maximum score would be equal to n. This also implies that n is an upper bound for 
the contribution made by the aligned zeros to the score of any alignment. Assume 
now that v is an alignment which aligns no more that 0.5n — en/8, pairs of ones. 
Let Sy denote the score of v. When Bq holds, using the bound for the contributions 
of the zeros in the score, we find that 

5„ < (0.5n - en/8) • s(l,l) +n. (4.5) 

Together, (j4.4p and (j4.5p imply that v is not an optimal alignment. This implies 
that when Bq holds, then any optimal alignment contains at least 0.5n — en/8 pairs 
of aligned ones. In other words, Bq implies Bf, when ()4.3p holds. ■ 

Recall that the score s(l, 1) is taken to be large. This insures, that typically 
there is a large proportion of the total number of ones, which get matched with 
a one by the optimal alignment. We introduce a special notation for alignments, 
which is convenient to describe alignments which align most ones with ones. Let us 
start with a numerical example: 

Take the finite sequence of pairs of natural numbers 

(0,0), (0,1), (1,0). 

According to our notations, this sequence represents an alignment which does the following: 
-First (0, 0) means that we align the first one of X with the first one of Y without skipping 
any one. 

-The second pair (0, 1) indicates that after the first pair of aligned ones, we skip a one in 
the Y sequence and no one in the X sequence. 

-The third pair (1, 0) means that after the second pair of aligned ones, we skip a one in X 
and no one in Y. 
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Take for example the sequences X = 0101011 and Y = 0001111. The alignment v 
((0, 0), (0, 1), (1, 0)) is then equal to: 










1 





1 





1 


1 











1 


1 


1 






1 



Recall that the score for aligning a zero with a one is zero: s(0, 1) ~ 0. 

Let denote the set of alignments which align exactly k pairs of ones with each 
other and such that there is a proportion of less than e/2 ones not belonging to 
pairs of aligned ones. (The ones which are not aligned with ones are counted up to 
the last pair of aligned ones.) Hence, with our representation of alignments of pairs 
of ones as sequences of couples of natural numbers we find 

V'' := {{vi,V2,...,Vk)\vi,V2,.. . G N X N>i| + \v2\ + ... + \vk\ < ek/2} , 

where if v = (a, b) G N^, \v\ := a + b. Let 

V := Ufc>p.„y'^ 

where p* := 0.5 — e/8. 

Let i?2 be the event that any optimal alignment of X with Y is contained in V. 
In other words, B2 holds when for every alignment such that = Ln, we have 
V € V. (Here 5„ is the score obtained by aligning X with Y and using for this the 
alignment v). 

The next lemma shows that Bq and Bf together imply B^: 



Lemma 4.2 When e > satisfies (|4.8p below, we have that 

5J n B'^ C B^, (4.6) 

for all n. 

Proof. Let v be an alignment. We say a one is matched by v if it gets aligned 
with another one by v. When it is clear from the context which alignment v we 
are talking about, we simply say that a one is matched. If Bq and B^ both hold, 
there are at most (en/ 16) + (en/8) non- matched ones in each text X and Y for any 
optimal alignment v. On the other hand, B^ insures that at least ^ — en/8 ones 
are matched in each text with a one from the other text by any optimal alignment 
V. This insures that the proportion of non-matched ones by the total number of 
matched ones is smaller or equal to 



en \ en o / 1 

T6 + T _ 3e / 1 



0.5n-en/8 8 \l-e/A 
Hence by choosing e > small enough, we get 



(4.7) 



' ^<i. (4.8) 



1-e/Aj - 2 

We assume henceforth that the inequality (j4.8p holds. This implies that the optimal 
alignment has a proportion of non-matched ones to matched ones smaller than e/2. 
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Adding to this, that by Bf there are at least | — en/8 pairs of aligned ones, we get 
that any optimal alignment is in V. Hence, B2 holds. We have just proved that Bq 
and B" together imply B2, when e > is small enough. ■ 

For V G V^, let {'K-u{i),v-i,{i)) be the indexes of the i-th pair of ones aligned by v. 
Hence, if 7r^(i) = j and ^'^;(^) = k, then all of the following holds: 
-Xj gets aligned with Y^. 
-Xj = Yk = l 

-The i-pair of aligned ones (by v) is Xj and Yk- 

For example, in the previous numerical example we have that 77^(1) = 2, 7r^(2) = 4 and 
7r„(3) = 7. Furthermore, ^'^(1) = 4, t'^(2) = 6 and t'^(3) = 7. 

Let V be an alignment. Recall that a one that gets aligned by v to another one 
(instead of to a gap or a zero), is called matched by v. 

Let Nr,{v) denote in the alignment v, the total number of subsequent pairs of ones 
in X satisfying all of the following conditions: 

-The ones are both matched, that is aligned by v with a one from Y. 
-Between the pair of ones in the X text there are only zeros. More precisely, we 
require that there is a block of five zeros, between the pair of ones in the X text. 
-The pair of ones in the Y text with which our pair from X is aligned, should contain 

only zeros in between them. 

More precisely: for v G V'', we define: 

N5{v) :=\{i< k\vi = (0, 0), 7r„(i + 1) - 7r^(i) = 6} | 

where v := {vi,V2, ■ ■ ■ ,Vk). 

Let A^5<(f) denote in the alignment v, the total number of subsequent pairs of ones 
in X satisfying all of the following conditions: 

-The ones are both matched, that is aligned by v with a one from Y. 

-Between the pair of ones in the X text there are only zeros and we require that 

there are exactly five zeros. 

-The pair of ones in the Y text with which our pair from X is aligned, should contain 
only zeros in between them and contain strictly less than 5. 
More precisely: 

N5<^{v) ■.= \{i < k\vi = (0,0),7r(i + l) -7r(i) =6,i/„(i + l) -z/^(i) < 6} |. 

Let C" be the event that for all v e V which is an optimal alignment, we have that 

iV5<(^) > 31 _fi 
N5{v) - 32 4 ■ 

Let i?3 be the event that in the sequence X there are at least ((1/32) — e/16)n 

blocks of zeros of length five. 

Let P5{v) be the conditional probability on X, that when we pick a block of five 
zeros at random in X, this block happens to satisfy the following two conditions: 
1) The block is contained between two consecutive matched ones. (Matched by the 
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alignment v.) This means that between the two matched ones there is our block of 
length five and nothing else. 

2) The pair of ones in the Y-text to which the pair of consecutive ones are aligned 
by V contains only zeros in between them and strictly less than five of them. 
In other words, p^iv) is the conditional probability (conditional on X) that when 
we pick at random a block of zeros of length five in X, there exists i < k, such 
that the randomly selected block is equal to [TTy{i — 1) + 1, Tr^{i) — 1] and all of the 
following holds: 

TTy{i) - TTy{i - 1) = 6, 

rivi'i) - Vvii - 1) < 6 

and \vi\ = 0, where 

V = {vi,V2,...,Vk) G (N X N)^ 



Recall that the total number of blocks with five zeros in X is denoted by n^. Fur- 
thermore, we select a block of length five with equal probability among all blocks 
of five zero in X. Hence, each block of five zeros in X has a conditional probability 
of l/ris to get selected. There, are A^5< blocks of five zeros in X satisfying our 
conditions. Hence the conditional probability P5{v) is equal to: 



P5{V) 



N5<{v) 



Let us return to a numerical example. Let X = 101010101000001 and let Y = 101000110001. 
Let V be the alignment 
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In X there is one block of five zeros. Hence, 715 = 1. Furthermore this block is contained 
directly between matched ones and the corresponding ones in the l^-text contain between 
them only zeros. Hence, our block of five zeros counts towards N5{v). We have N^lv) = 1. 
The block of five zeros in X is matched with a block with three zeros in Y, hence with a 
block with strictly less zeros. Thus, the block with five zeros is also counted towards N^^: 
and we find N^^ = 1. Thus, the conditional probability is equal to 1. 



Let be the event that 



P5{v) > 



31 
32 



2 



for every optimal alignment v ^ V. 

Lemma 4.3 For e > and ei > satisfying the inequality (I4.14p below, we have 

n n 5J n C" c ^" (4.9 

for all n. 
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Proof. We have for any alignment v GV, that 



m N^^v) 77,5 

When, the event C" holds, then, for every v £ V, we have 

N,{v) -32 

Combining the last inequality with the equality (j4.10p , yields 



(4,10) 



"-('')>fE-,,/4ViM!^. (4.11, 



775 \32 / 775 

Now, Bq and together imply B2, hence, when Bq and B^ both hold, then every 
optimal alignment is in V. Thus, (j4.1ip also holds for every optimal alignment v. 
If Bq and i3" both hold, there are at most (e?7,/16) + {en/8) non-matched ones in 
each text X and Y for any optimal alignment v. This also implies that the number 
of blocks of five zeros in X which do not satisfy the criteria to be counted towards 
Nr,{v) is at most 2[(e77/16) + (e?7,/8)], (for every optimal alignment v). This implies 
that for every optimal alignment v, when Bq and Bf both hold, then 

N^iv) > 775 - 3r7e/8. 

The last inequality with (j4.1ip implies 



775 \32 J \ 8775 

for every optimal alignment v. When i^g holds, we have that 

775 > ((1/32) - e/16)77, 

and hence 

1-^>1 



8775 - 1/4 - e/2' 
Using the above inequality in (|4.12p gives: 



775 ~ V32 /V 1/4 -e/2 

Note that 

.-oV 1/4-6/2; 

Hence for any ei > fixed, if we chose e > small enough, (depending on ei), we 
get 

/^31 ,\ / 3e \ 31 ei 
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We henceforth assume that the inequahty (j4.14p holds. This together with (j4.13p 
yields: 

iV5<W > 31 _ ei 
ns - 32 2 ■ 

Hence E"^ holds. We have just proved that if ei > and e > are chosen so that 
(imH holds, then B^}, B^, Bl^ and C" together imply the event E"". m 

Let Ni{v) denote in the alignment v, the total number of subsequent pairs of 
ones in X satisfying all of the following conditions: 
-The ones are both matched, that is aligned by v with a one from Y. 
-Between the pair of ones in the X text there is exactly one zero and nothing else. 
-The pair of ones in the Y text with which our pair from X is aligned, should contain 
only zeros in between them. 
More precisely: for v G V^, we define: 

Ni{v) ■.= \{i< k\vi = (0,0),7r„(i + l) -vr^i) =2}|, 

where v := {vi,V2, ■ ■ ■ ,Vk)- 

Let Ni^{v) denote in the alignment v, the total number of subsequent pairs of ones 
in X satisfying all of the following conditions: 

-The ones are both matched, that is aligned by v with a one from Y. 

-Between the pair of ones in the X text there are only zeros and we require that 

there is exactly one zeros. 

-The pair of ones in the Y text with which our pair from X is aligned, should contain 
only zeros in between them and contained two or more. 
More precisely: 

Ni^{v) :=\{i< k\vi = (0, 0), 7r(i + 1) - 7r(i) = 2,iy^{i + 1) - > 2} |. 

Let B2 be the event that in the sequence X there are at least ((1/4) — e/16)n blocks 
of zeros of length two. 

Let pi{v) be the conditional probability on X, that when we pick a block of one 
zeros at random in X, this block happens to satisfy the following two conditions: 

1) The block is contained between two consecutive matched ones. (Matched by the 
alignment v.) This means that between the two matched ones there is our block of 
length one and nothing else. 

2) The pair of ones in the Y-text to which the pair of consecutive ones are aligned 
by V contains only zeros in between them and at least two of them. 

In other words, pi{v) is the conditional probability (conditional on X) that when 
we pick at random a block of zeros of length one in X, there exists i < k, such 
that the randomly selected block is equal to [iTy{i — 1) + 1, 71^(1) — 1] and all of the 
following holds: 

7r„(z) - 7r„(i - 1) = 2, 
Vvii) - ilv{i - 1) > 2 

and \vi\ =0, where 

V = {vi,V2,...,Vk) G (N X N)^ 
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Recall that the total number of blocks made of one zeros in X is denoted by ni. 
Hence the conditional probability pi{v) is equal to: 

, , A^i>(^) 

Piiv) ■= • 

ni 

Let us go back to our numerical example. Again, let X = 101010101000001 and let 
Y = 101000110001. Let v be the alignment 
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There are three blocks of zeros of length one in X. Hence, ni = 3. 

Among the blocks of one zero in X only two are directly comprised between matched ones. 
Hence, Ni{v) = 2. 

One of the two "suitable" blocks of one zero in X is matched with a block of zeros in Y 
strictly longer than 1. Hence, A^i> — 1 and pi{v) — 1/3. 



Let F"' be the event that 

Pi{v) >--—. 
for every optimal alignment v £ V. 

Let Z?" be the event that for al\ v £ V which is an optimal alignment, we have that 

iVi>(^) > 
Ni{v) - 4 4 ■ 

Lemma 4.4 Assume that e > and ei satisfy the inequality (|4.20p below, then 

B^n^i nz?" c (4.15) 

for all n. 

Proof. We have for any ahgnment v £V , that 



Til ni 
When, the event holds, then, for every v £ V, we have 

A^i>(^) > 
A^i(t;) - 4 4 ■ 

Combining the last inequality with ()4.16p . yields 



(4.16) 



''^>i-y>(\.,,/A.Ml. (4,17) 



Til / ^1 

We have that Bq and Bf together imply ^2 . Hence, when Bq and B^ both hold, 
then every optimal alignment is in V. Hence, (j4.17p also holds for every optimal 
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alignment v. 

If Bq and both hold, there are at most (en/16) + (en/8) non-matched ones in 
each text X and Y for any optimal alignment v. This also implies that the number 
of blocks of one zeros in X which do not satisfy the criteria to be counted towards 
Ni{v) is at most 2 [(en/ 16) + (en/8)], (for every optimal alignment v). This implies 
that for every optimal alignment v, when Bq and Bf both hold, then 

Ni{v) > ni - 3ne/8. 

The above inequality with (I4.17P implies 

^i<(^^) /I , \ f 3ne\ 



ni \4 J \ 8ni 

for every optimal alignment v. When B2 holds, we have that 

ni > ((1/4) -e/16)n, 

and hence 

^ 3ne ^ ^ 3e 



8ni - 2 - e/2 

Note that the expression on the right side of the above inequality goes to 1 as e 
converges to zero, hence (j4.18p becomes: 

^.>w,(i ,/4V(i-^y (4.1.) 



m - V4 y V 2 -e/2 

Note that 

lim f 1 ^) = 1. 

Hence for any ei > fixed, if we chose e > small enough, (depending on ei), we 
get 

1 , \ / 3e \ 31 ei , , 

We henceforth assume that the inequality (j4.20p holds. This together with (j4.19p 
yields: 

Ni>{v) > 1 _£i 
ni - 4 2 ■ 

Hence holds. We have just proved that if ei > and e > are chosen so that 
(1^:20]) holds, then B^, Bf, B^ and D"- together imply the event F". ■ 



Let us return to a numerical example. Let X = 101010101000001 and let Y = 101000110001. 
Let V be the alignment 
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Note that there are two consecutive aligned ones with one zero in between in the X-part. 
Hence Ni{v) = 2. Note, that the third and fifth one in the string X are consecutive aligned 
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ones with one zero between them. But there is also a non-ahgned one between them, so 
they do not count towards Ni{v). Instead the first consecutive ahgned one counting towards 
Ni{v) is given by the first and the second one in X. Then, the second and third one in X 
constitutes such consecutive couple of ones counting towards Ni (w) . 

Among the two pairs of consecutive aligned ones with one zero in between, there is one 
which has strictly more than 1 zero in between the ones in the Y part. Hence, A^i> = 1. 
There is here one consecutive pair of aligned ones with five zeros in between. It is given by 
the fifth and sixth one in X. Hence, iV5(w) — 1. 

The only pair of consecutive aligned ones with five zeros in between in the X part has 3 
zeros in between in the Y part. Hence, iV5< = 1. 

Recall that is the event that X and Y are such that the inequalities (|2.5p 
and p.6p are satisfied. 

The next combinatorial lemma of this paper is: 

Lemma 4.5 When ei > satisfies the inequality ()4.24p below, then 

n C (4.21) 

holds for every n G N. 

Proof. We say that a block of zeros in X is aligned with another block of zeros in 
y by f , if these blocks are in between consecutive mutually aligned ones. 

Let us give an example. Let X = 101 and Y = 1001. Let v be the alignment 
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In this example X has one block of zeros. This block has length one. The text Y has also 
one block of zeros. This block has length two. We say that the block of zeros in X gets 
aligned by v with the block of zeros in Y . 

Recall that to obtain L„ from L„, we pick a block of five zeros at random in X and 
reduce its length by one. Then we add a zero to a randomly chosen block of one 
zero in X. The modified text is denoted by X. The optimal score between X and 
Y is L. 

For any optimal alignment v, we have: 

When the selected block of five zeros in X is aligned by v with a block of length 
strictly less than five, then the score is not reduced. When, on top of that, the extra 
zero is added to a block of length one which is aligned by v to a block of at least 
two zeros, than the score is increased by one. The block of length five and the block 
of length one are chosen independently from each other, so that we find: 

P{Ln -K = 1\X, Y) > p^iv) ■ pi{v), (4.22) 

for any optimal alignment v. When, and both hold, then, for any optimal 
alignment v we have: 

, , fl €i\ /31 eA 1 31 /35 eA x 
. > ( J - -j (- - - j = J . - - (- + - j . (4.23) 
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For ei > small enough, we have that 

We assume that ei is chosen small enough so that (j4.24p holds. Then, (|4.22p and 
(j4.23p together imply that 

1 31 

P{Ln -K = l\X,Y)>-.--e,. (4.25) 

Let V be an optimal alignment. We saw that when the selected block of length 
five is aligned with a block of length strictly less, than five then the score can not 
decrease, i.e.: 

Ln ~ > 0. 

It follows that for any optimal alignment v, we have 

P{Ln-Ln = -l\X,Y)<l-p^{v). (4.26) 
When En holds, we have 

P5(t^)>§-|. (4.27) 
Together, the inequalities ()4.26p and (j4.27p imply 

P{Ln-Ln = -l\X,Y)<^ + ei. (4.28) 

We have just proved that when i?" and both hold, then (j4.25p and (j4.28p both 
hold. In other words, we proved that and jointly imply ^4", when ei > 
satisfies (g^D- ■ 



5 Probability bounds 
5.1 The bounds 

We start with some useful bounds: 

Lemma 5.1 There exists 70 > 0, independent of n, such that 

P{B^) > 1 - e~^°". (5.1) 

(Note that 70 depends on e.) 

Proof. The sequence X, resp. Y is iid. The probability that Xi = 1, resp. Kj = 1 
is equal to ^. Hence, by large deviation, the probability that the average X]?^*/^ 
is different from its mean by more than e/16 is exponentially small in n. ■ 

The next bound is: 



33 



Lemma 5.2 There exists 73 > 0, independent of n, such that 



P{B^) > 1 - e-^3n_ (5,2) 

(Note that 73 depends on e). 

Proof. The blocks in X are iid. The probabihty that a block has length five is equal 
to 1/32. Large deviation applies to the probability that the proportion of blocks of 
zeros which has length five is below expectation by e/16. Hence that probability is 
exponentially small in n. ■ 

Similarly, 

Lemma 5.3 There exists 74 > 0, independent of n, such that 

P{Bl) > 1 - e"^*". (5.3) 

(Note that 74 depends on e). 

Proof. Essentially the same as the proof of the previous lemma above. ■ 
The next lemma gives an upper bound on the number of elements in the set V . 
Lemma 5.4 We have that 

where H is the entropy function. 

Proof. First we need to determine which entries are not zero. There are at most 
ek/2 non-zero entries, which have to be chosen from 2k entries. Hence this gives a 
total of 

possibilities. Then, we have to chose how large each entry is. This means that we 
have to distribute among the non-zero entries (which we already determined in the 
previous step) a total of efc/2 integer points. This is the same as finding an integer 
partition of the interval 2"^*^/^. There are at most 2''^/'^ integer partitions of the 
interval [0,eA;/2]. This with ()5.5p means that contains no more than 

elements. ■ 
Eventually, we have 

Lemma 5.5 Assume that e > and ei are such that (|5.8p and (|5.13p both hold. 
Then, we have that: 

P(C") > 1 - e"^^'^, (5.6) 
where 7c is a positive constant independent of n, (hut depending on e). 
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Proof. Let be the event that if N^{v) > n/33, then C" holds. In other words 

c." = (c"n{A.,w>^})u{A.,(„)<^}. 

Note that when holds there are at least (1/32 — e/16)n blocks of five zeros in 
X. When Bq and i?" both hold, then we argued that there at most 6en/16 ones 
not matched total in both sequences X and Y for any optimal alignment v. 
This gives that when Bq, B^ and B^ all hold, then for any optimal alignment v, 
we have that 

N5{v) > (1/32 - 7e/16)n. (5.7) 
For e > small enough, we have 

1 7e 1 , , 

> — . 5.8) 

32 16 - 33 ^ ^ 

From here on we assume that (|5.8p holds, so that (j5.7p implies that 

n 



Hence 



and hence 



We have that 



B^nB^nBr}c[N,{v)>^}, 

{n,{v) < ^} c {B^r u {B^r u {B^r. (5.9) 
ic-ycic:ru{N,iv)<^} 



JTIC 

h ■ 



and hence with (jS.Op and with Lemma 4.1, we find 

(C")'= C CT^B'S^UBl 
The last inclusion implies that 

P(C7"^) < P(Cr) + ^"(^0^) + ^(^D- 

We already prove exponentially negative bounds for P{Bq'^) and for P{B^'^). Hence 
it only remains to prove an exponential negative upper bound for P(C^'^). 
Let Zi,Z2, ■ ■ ■ be a sequence of iid geometric random variables with parameter 1/2. 
Let Wi be the indicator variable which is equal to 1 if Zi < 5. Hence, P{Wi = 1) = 
31/32. We have that: 

P(cr) < t i^'i • ^ (ii^±i^^^^ < I - ./4.) ,5.10) 



fc=n/33 

By large deviation we find 
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for some constant 7(ei). 

Combining ()5.4p and (jS.lip . we obtain 

\ + W2+ +W, ^ 31 _ ^^^\ ^ _ ^_,,(,,)_ ^^^^2) 

K 33 / 

Hence ()5.10p and (j5.12p together provide an exponential upper bound for P{Cl^^) 
as soon as the following inequality: 

i/(e) + e - 7(ei) < (5.13) 

is satisfied. Note that 

lim (H(e) + e) = 0, 

whilst 7(ei) > 0, for every ei > 0. This implies that for any ei > fixed, (|5.13p 
holds, when taking e > small enough. ■ 
In complete similarly to the previous lemma we have: 

Lemma 5.6 Assume that e > and ei are such that (15. Sp and (15.13P both hold. 
Then, there exists 7^ > independent of n such that 

P{D'') > 1 - e-^""". (5.14) 
Proof. This proof is similar to the previous one and so is omitted. ■ 




5.2 Overview 

In Section 3 we proved that VarL„ is of order n. For this we assumed that Theo- 
rem 2.2 holds. So it still remains to prove Theorem 2.2. First, let us mention that 
the order n for Var[L„] follows from two things: 

a) An := Ln — Ln needs to have a positive bias. More precisely, we want that for 
any (x, y) G j4" and any n, we have that 

E[Ln- Ln\X = x,Y = y]> K, 

where -ftr > is some positive constant. 

b) The probability P{A"') needs to be close to one. More precisely we want the 
quantity 1 — P(j4") to be no more than stretched negative exponential. 

Let us first mention problem a). Note that between L„ and L„ the score can change 
by at most one. In other words, 

P(L„-L, G {-1,0,1}) = 1. (5.15) 

From the equation (I5.15P and with the help of (12. 5p and (12. 6p . we find: 

31 1 23 

E[Lrr - L„ = 1|X = X, y = y) > — - - - ei = — - ei. (5.16) 
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Taking ei > small enough so that 



. < i (5.17) 



insures the positive bias. 



Let us next discuss problem b). In the previous section, we proved that the following 
inclusions hold: 

Bq C B^, when the condition (|4.3|) holds. 
Bl}nB{' C B^, when the condition (filH]) holds. 

n n 5f n C" C S*", when the condition (imj) holds. 
Bp" n -Bf n 5J n Z?" C F", when the condition K20\\ holds. 
F" n F" C A", when the condition (jiTMl) holds. 

These inclusions imply, when the conditions (fOj) . (fO]) . (fiT^ . ([OO]) and (fi:2i|) ah 
hold, that, 

n n 5J n F" n c A", 

and hence 



< P(Bo"^) + F(Br) + ^'(^r) + F(C7"^) + P(F)"^). (5.18) 

The inequality (jS.lSp implies that P(j4"^) is negatively exponentially small in n, as 
soon as we have negative exponential bounds for P{Bq:'^), P{B^'^), P{B2'^), P{C^^) 
and P(F)"^). 

The probabilities P{Bq^), P{B'^^) and P{B2'^), only depend on e. For any value of 
e > 0, the inequalities (|5.1[) . (15. 2p and (|5.3p provide negative exponential bounds for 
P{Bq:'^), P(B^'^) and P{B2^)- Hence, there is no special condition on e > and ei > 
in order to ensure that P{Bq'^), P{B^'^) and P{B2^) are negative exponentially 
small in n. 

We also proved exponential negative upper bounds for P{C^^) and P[D'^^). These 
bounds are the inequalities ()5.6p and ()5.14p . However these bounds only hold if e 
and ei satisfy the conditions (|5.8p and (j5.13p . 

To prove that P{A^'^) is exponentially small it thus remains to prove that there 
exists e and ei satisfying all the following: 

a) The condition (15.17p . which gives L„ — L„ its conditional bias (j5.16p . 

b) All the conditions for the inclusions. These are the inequalities (|4.3p . (|4.8p . (j4.14p . 
(lt2n]) and Km . 

c) The conditions for the exponential upper bounds for P[C"''^) and P(D"'^). These 
are the inequalities (|5.8p and (j5.13p . 

To see that there exist e, ei > satisfying all the above conditions simultaneously, 
note that these conditions can be classified in three types: 

type I: conditions involving only ei. These conditions all hold for ei > small 
enough. 

Type H: conditions involving ei and e. All these conditions are such that for any 
ei > fixed, they hold as soon as e > is small enough. 

Type HI: conditions involving only e. They all hold as soon as e is taken small 
enough. 



37 



It is now easy to see that there exists ei > and e > such that all the conditions 
([5T71) . ([TSD . m . (ITOD . (OOD . (Oil) . dSSl), and (IET3]) simultaneously hold. For 
this chose first and ei > small enough so that all equations of type I are satisfied. 
Then choose e > small enough so that all conditions of type II and type III are 
satisfied. Summarizing we have just shown that: There exist e, ei > and ci, s > 
not depending on n, such that if 1) > s, then 

PiA'"') < e^i", (5.19) 

for all n. 
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